智能论文笔记

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Hang Chen , Keqing Du , Xinyu Yang , Chenguang Li

分类：机器学习 | 人工智能

2022-09-14

了解因果关系有助于构建干预措施，以实现特定的目标并在干预下实现预测。随着学习因果关系的越来越重要，因果发现任务已经从使用传统方法推断出潜在的因果结构从观察数据到深度学习涉及的模式识别领域。大量数据的快速积累促进了具有出色可扩展性的因果搜索方法的出现。因果发现方法的现有摘要主要集中在基于约束，分数和FCM的传统方法上，缺乏针对基于深度学习的方法的完美分类和阐述，还缺乏一些考虑和探索因果关系的角度来探索因果发现方法范式。因此，我们根据变量范式将可能的因果发现任务分为三种类型，并分别给出三个任务的定义，定义和实例化每个任务的相关数据集以及同时构建的最终因果模型，然后审查不同任务的主要因果发现方法。最后，我们从不同角度提出了一些路线图，以解决因果发现领域的当前研究差距，并指出未来的研究方向。

translated by 谷歌翻译

Automatic Semantic Modeling for Structural Data Source with the Prior Knowledge from Knowledge Base

Jiakang Xu , Wolfgang Mayer , HongYu Zhang , Keqing He , Zaiwen Feng

分类：人工智能

2022-12-21

A critical step in sharing semantic content online is to map the structural data source to a public domain ontology. This problem is denoted as the Relational-To-Ontology Mapping Problem (Rel2Onto). A huge effort and expertise are required for manually modeling the semantics of data. Therefore, an automatic approach for learning the semantics of a data source is desirable. Most of the existing work studies the semantic annotation of source attributes. However, although critical, the research for automatically inferring the relationships between attributes is very limited. In this paper, we propose a novel method for semantically annotating structured data sources using machine learning, graph matching and modified frequent subgraph mining to amend the candidate model. In our work, Knowledge graph is used as prior knowledge. Our evaluation shows that our approach outperforms two state-of-the-art solutions in tricky cases where only a few semantic models are known.

translated by 谷歌翻译

Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems

Weihao Zeng , Keqing He , Zechen Wang , Dayuan Fu , Guanting Dong , Ruotong Geng , Pei Wang , Jingang Wang , Chaobo Sun , Wei Wu

分类：自然语言处理

2022-10-17

Recent advances in neural approaches greatly improve task-oriented dialogue (TOD) systems which assist users to accomplish their goals. However, such systems rely on costly manually labeled dialogs which are not available in practical scenarios. In this paper, we present our models for Track 2 of the SereTOD 2022 challenge, which is the first challenge of building semi-supervised and reinforced TOD systems on a large-scale real-world Chinese TOD dataset MobileCS. We build a knowledge-grounded dialog model to formulate dialog history and local KB as input and predict the system response. And we perform semi-supervised pre-training both on the labeled and unlabeled data. Our system achieves the first place both in the automatic evaluation and human interaction, especially with higher BLEU (+7.64) and Success (+13.6\%) than the second place.

translated by 谷歌翻译

Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation

Yanan Wu , Zhiyuan Zeng , Keqing He , Yutao Mou , Pei Wang , Weiran Xu

分类：自然语言处理

2022-09-14

室外（OOD）检测是面向任务的对话框系统中的关键组件，旨在确定查询是否不在预定义的支持的意图集之外。事实证明，先前基于软磁性的检测算法对OOD样品被过度自信。在本文中，我们分析了过度自信的OOD来自由于训练和测试分布之间的不匹配而导致的分布不确定性，这使得该模型无法自信地做出预测，因此可能导致异常软磁得分。我们提出了一个贝叶斯OOD检测框架，以使用Monte-Carlo辍学来校准分布不确定性。我们的方法是灵活的，并且可以轻松地插入现有的基于软磁性的基线和增益33.33 \％OOD F1改进，而与MSP相比仅增加了0.41 \％的推理时间。进一步的分析表明，贝叶斯学习对OOD检测的有效性。

translated by 谷歌翻译

Generalized Intent Discovery: Learning from Open World Dialogue System

Yutao Mou , Keqing He , Yanan Wu , Pei Wang , Jingang Wang , Wei Wu , Yi Huang , Junlan Feng , Weiran Xu

分类：自然语言处理

2022-09-13

传统意图分类模型基于预定义的意图集，仅识别有限的内域（IND）意图类别。但是用户可以在实用的对话系统中输入室外（OOD）查询。这样的OOD查询可以提供未来改进的方向。在本文中，我们定义了一项新任务，广义意图发现（GID），旨在将IND意图分类器扩展到包括IND和OOD意图在内的开放世界意图集。我们希望在发现和识别新的未标记的OOD类型的同时，同时对一组标记的IND意图类进行分类。我们为不同的应用程序方案构建了三个公共数据集，并提出了两种框架，即基于管道的框架和端到端，以实现未来的工作。此外，我们进行详尽的实验和定性分析，以理解关键挑战，并为未来的GID研究提供新的指导。

translated by 谷歌翻译

Unified Knowledge Prompt Pre-training for Customer Service Dialogues

Keqing He , Jingang Wang , Chaobo Sun , Wei Wu

分类：自然语言处理

2022-08-31

对话机器人已广泛应用于客户服务方案，以提供及时且用户友好的体验。这些机器人必须对对话的适当域进行分类，了解用户的意图并产生适当的响应。现有的对话预训练模型仅针对多个对话任务而设计，而忽略了弱监督的客户服务对话中的专家知识。在本文中，我们提出了一个新颖的统一知识提示预训练框架，ufa（\ textbf {u} nified Model \ textbf {f}或\ textbf {a} ll任务），用于客户服务对话。我们将客户服务对话的所有任务作为统一的文本到文本生成任务，并引入知识驱动的及时策略，以共同从不同的对话任务中学习。我们将UFA预先训练UFA，从实用场景中收集的大型中国客户服务语料库中，并对自然语言理解（NLU）和自然语言生成（NLG）基准进行了重大改进。

translated by 谷歌翻译

PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling

Guanting Dong , Daichi Guo , Liwen Wang , Xuefeng Li , Zechen Wang , Chen Zeng , Keqing He , Jinzheng Zhao , Hao Lei , Xinyue Cui

分类：自然语言处理

2022-08-24

大多数现有的插槽填充模型倾向于记住实体的固有模式和培训数据中相应的上下文。但是，这些模型在暴露于口语语言扰动或实践中的变化时会导致系统故障或不良输出。我们提出了一种扰动的语义结构意识转移方法，用于训练扰动插槽填充模型。具体而言，我们介绍了两种基于传销的培训策略，以分别从无监督的语言扰动语料库中分别学习上下文语义结构和单词分布。然后，我们将从上游训练过程学到的语义知识转移到原始样本中，并通过一致性处理过滤生成的数据。这些程序旨在增强老虎机填充模型的鲁棒性。实验结果表明，我们的方法始终优于先前的基本方法，并获得强有力的概括，同时阻止模型记住实体和环境的固有模式。

translated by 谷歌翻译

Conditional Diffusion Based on Discrete Graph Structures for Molecular Graph Generation

Han Huang , Leilei Sun , Bowen Du , Weifeng Lv

分类：机器学习

2023-01-01

Learning the underlying distribution of molecular graphs and generating high-fidelity samples is a fundamental research problem in drug discovery and material science. However, accurately modeling distribution and rapidly generating novel molecular graphs remain crucial and challenging goals. To accomplish these goals, we propose a novel Conditional Diffusion model based on discrete Graph Structures (CDGS) for molecular graph generation. Specifically, we construct a forward graph diffusion process on both graph structures and inherent features through stochastic differential equations (SDE) and derive discrete graph structures as the condition for reverse generative processes. We present a specialized hybrid graph noise prediction model that extracts the global context and the local node-edge dependency from intermediate graph states. We further utilize ordinary differential equation (ODE) solvers for efficient graph sampling, based on the semi-linear structure of the probability flow ODE. Experiments on diverse datasets validate the effectiveness of our framework. Particularly, the proposed method still generates high-quality molecular graphs in a limited number of steps.

translated by 谷歌翻译

HUSP-SP: Faster Utility Mining on Sequence Data

Chunkai Zhang , Yuting Yang , Zilin Du , Wensheng Gan , Philip S. Yu

分类：人工智能

2022-12-29

High-utility sequential pattern mining (HUSPM) has emerged as an important topic due to its wide application and considerable popularity. However, due to the combinatorial explosion of the search space when the HUSPM problem encounters a low utility threshold or large-scale data, it may be time-consuming and memory-costly to address the HUSPM problem. Several algorithms have been proposed for addressing this problem, but they still cost a lot in terms of running time and memory usage. In this paper, to further solve this problem efficiently, we design a compact structure called sequence projection (seqPro) and propose an efficient algorithm, namely discovering high-utility sequential patterns with the seqPro structure (HUSP-SP). HUSP-SP utilizes the compact seq-array to store the necessary information in a sequence database. The seqPro structure is designed to efficiently calculate candidate patterns' utilities and upper bound values. Furthermore, a new upper bound on utility, namely tighter reduced sequence utility (TRSU) and two pruning strategies in search space, are utilized to improve the mining performance of HUSP-SP. Experimental results on both synthetic and real-life datasets show that HUSP-SP can significantly outperform the state-of-the-art algorithms in terms of running time, memory usage, search space pruning efficiency, and scalability.

translated by 谷歌翻译

PersonaSAGE: A Multi-Persona Graph Neural Network

Gautam Choudhary , Iftikhar Ahamath Burhanuddin , Eunyee Koh , Fan Du , Ryan A. Rossi

分类：机器学习

2022-12-28

Graph Neural Networks (GNNs) have become increasingly important in recent years due to their state-of-the-art performance on many important downstream applications. Existing GNNs have mostly focused on learning a single node representation, despite that a node often exhibits polysemous behavior in different contexts. In this work, we develop a persona-based graph neural network framework called PersonaSAGE that learns multiple persona-based embeddings for each node in the graph. Such disentangled representations are more interpretable and useful than a single embedding. Furthermore, PersonaSAGE learns the appropriate set of persona embeddings for each node in the graph, and every node can have a different number of assigned persona embeddings. The framework is flexible enough and the general design helps in the wide applicability of the learned embeddings to suit the domain. We utilize publicly available benchmark datasets to evaluate our approach and against a variety of baselines. The experiments demonstrate the effectiveness of PersonaSAGE for a variety of important tasks including link prediction where we achieve an average gain of 15% while remaining competitive for node classification. Finally, we also demonstrate the utility of PersonaSAGE with a case study for personalized recommendation of different entity types in a data management platform.

translated by 谷歌翻译